Letters to the Editor

< Previous         Next >  
A novel strategy for deciphering dynamic conservation of gene expression relationship Free
Liyun Yuan1, Guohui Ding2,3, Y. Eugene Chen4,5, Zhe Chen6,*, and Yixue Li1,2,3,*
1School of Life Science and Biotechnology, Shanghai Jiaotong University, Shanghai 200240, China
2Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
3Shanghai Center for Bioinformation Technology, Shanghai 200235, China
4Cardiovascular Center, Department of Internal Medicine, University of Michigan Medical Center, Ann Arbor, MI 48109, USA
5Drug Discovery and Design Centre, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
6National Clinical Research Base of Traditional Chinese Medicine, Zhejiang Hospital of Traditional Chinese Medicine, Zhejiang Chinese Medical University, Hangzhou 310006, China *Correspondence to:Yixue Li, E-mail: yxli@sibs.ac.cn; Zhe Chen, E-mail: chenzhe@zju.edu.cn
J Mol Cell Biol, Volume 4, Issue 3, June 2012, 177-179,  https://doi.org/10.1093/jmcb/mjs014

Time serial microarray or RNA-Seq experiments were frequently used in searching specific genes or biomarkers associated with carcinogenesis, and subsequently in identifying novel cancer subtypes and implementing molecular cancer diagnosis (Chung et al., 2002; Liau and Whang, 2009). Thus, developing methods to analyze time serial gene expression profiling remains ever important and is always a challenge.
Popular statistical methods, such as the Student's t-test, clustering and analysis of variance analyses, etc., are widely used in analyzing time serial gene expression data, and are successful for finding disease-related genes or biomarkers. However, there still are many general problems remaining to be solved. Some major problems in mining gene expression data include high false positives, low sensitivity and low overlap of genes found by different methods. In general, the reason for drawbacks can come from a variety of sources, but two main aspects may be behind the problems. First, almost all of the methods mentioned above used to ignore the continual changes of a gene during the whole stages of disease; secondly, they take no account of the continuous stabilities or conservative properties of time serial gene expression. In term of this, two approaches, multiclass ordinal analyses and multi-category logit model algorithm (Pyon and Li, 2009), were presented and used in finding disease-related gene signatures and biomarkers. Both methods are based on a simple hypothesis that consistently increasing/decreasing expressed genes will show concordance/discordance with cancer progressions.
However, due to the complexity and heterogeneousness of gene expression profiling, no methods can promise a perfect solution in analyzing this kind of data.